108 results found.
Written
text normalisation resources,
Language Type:
Multilingual
Languages:
Chinese
Availability:
Freely Available
License:
Creative Commons Attribution-NonCommercial- ShareAlike 4.0
Size:
15 KByte Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
Paper:
N/A
Documentation:
<Not Specified>
Speech/Written
Lexicon,
Language Type:
Monolingual
Languages:
Chinese
Availability:
From Owner
License:
Size:
1.9 MByte Production Status:
Newly created-in progress
Use:
Semantic Role Labeling
-
Paper title:Construct a Sense-Frame Aligned Predicate Lexicon for Chinese AMR Corpus
-
Paper track:Infrastructural Issues/Large Projects/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Li Song | Chinese Sense-Frame Aligned Predicate Lexicon | /N |
Documentation:
None
Image and annotated text
Evaluation Data,
Language Type:
Monolingual
Languages:
Chinese
Availability:
Freely Available
License:
forthcoming
Size:
438249 tokens Production Status:
Newly created-finished
Use:
Named Entity Recognition
-
Paper title:Building OCR/NER Test Collections
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | James Mayfield | Renmin-OCR-NER | /N |
Documentation:
Yes, English, Yes
Written
Representation-Annotation Formalism/Guidelines,
Language Type:
Bilingual
Languages:
Chinese English
Availability:
Freely Available
License:
Attribution-NonCommercial-ShareAlike 4.0 International
Size:
1.6 MByte Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:Building an English-Chinese Parallel Corpus Annotated with Sub-sentential Translation Techniques
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yuming Zhai | Annotation Guidelines of Translation Techniques for English-Chinese | /N |
Documentation:
None
Written
Corpus,
Language Type:
Bilingual
Languages:
Chinese English
Availability:
Freely Available
License:
Attribution-NonCommercial-ShareAlike 4.0 International
Size:
50 sentences Production Status:
Newly created-in progress
Use:
Corpus Creation/Annotation
-
Paper title:Building an English-Chinese Parallel Corpus Annotated with Sub-sentential Translation Techniques
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yuming Zhai | English-Chinese parallel corpus annotated with translation techniques | /N |
Documentation:
None
Written
Treebank,
Language Type:
Monolingual
Languages:
Afrikaans Akkadian Amharic Ancient Greek Arabic Armenian Assyrian Bambara Basque Belarusian Bhojpuri Breton Bulgarian Buryat Cantonese Catalan Chinese Classical Chinese Coptic Croatian Czech Danish Dutch English Erzya Estonian Faroese Finnish French Galician German Gothic Greek Hebrew Hindi Hindi English Hungarian Indonesian Irish Italian Japanese Karelian Kazakh Komi Permyak Komi Zyrian Korean Kurmanji Latin Latvian Lithuanian Livvi Maltese Marathi Mbya Guarani Moksha Naija North Sami Norwegian Old Church Slavonic Old French Old Russian Persian Polish Portuguese Romanian Russian Sanskrit Scottish Gaelic Serbian Skolt Sami Slovak Slovenian Spanish Swedish Swedish Sign Language Swiss German Tagalog Tamil Telugu Thai Turkish Ukrainian Upper Sorbian Urdu Uyghur Vietnamese Warlpiri Welsh Wolof Yoruba
Availability:
Freely Available
License:
Various
Size:
25 million words Production Status:
Existing-updated
Use:
Parsing and Tagging
-
Paper title:Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Joakim Nivre | Universal Dependencies | /N |
Documentation:
https://universaldependencies.org
Written
Lexicon,
Language Type:
Multilingual
Languages:
Bulgarian Catalan Chinese Dutch English Estonian Finnish Italian Portuguese Slovenian Spanish Swedish Thai and Turkish
Availability:
Freely Available
License:
Open Source
Size:
41 411 senses for Bulgarian, 35 820 for Swedish OtherProduction Status:
Newly created-in progress
Use:
Word Sense Disambiguation
-
Paper title:A Parallel WordNet for English, Swedish and Bulgarian
-
Paper track:Written/poster presentation with demo
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Krasimir Angelov | GF WordNet | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
Chinese
Availability:
From Owner
License:
Size:
20451337 sentences Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:The JDDC Corpus: A Large-Scale Multi-Turn Chinese Dialogue Dataset for E-commerce Customer Service
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Meng Chen | The JDDC Corpus | /N |
Documentation:
None
Written
Treebank,
Language Type:
Monolingual
Languages:
Chinese
Availability:
From Owner
License:
Size:
500 documents OtherProduction Status:
Existing-used
Use:
Discourse
-
Paper title:Shallow Discourse Annotation for Chinese TED Talks
-
Paper track:Infrastructural Issues/Large Projects/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Xinyi Cai | Chinese Discourse Treebank (CDTB) | /N |
Documentation:
None
Written
Corpus,
Language Type:
Bilingual
Languages:
Chinese English
Availability:
Freely Available
License:
Size:
650 MByte Production Status:
Newly created-finished
Use:
Named Entity Recognition
-
Paper title:A Chinese Corpus for Fine-grained Entity Typing
-
Paper track:Evaluation/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Chin Lee | A Chinese Corpus for Fine-grained Entity Typing | /N |
Documentation:
None




